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Abstract 



■ In this paper, we prove that, in dimension one, the Poincare inequality is equivalent to a 

Qh . new transport-chi-square inequality linking the square of the quadratic Wasserstein distance 

with the chi-square pseudo-distance. We also check tensorization of this transport-chi-square 
+^ , inequality. 

For q > 1, the Wasserstein distance with index q between two probability measures [x and v on 
R d is denoted by 

W*(ji, v) = inf / \x- y\ q d 7 (x, y) (0.1) 

where the infimum is taken over all probability measures 7 on M d x M d with respective marginals 
H and v. We also introduce the relative entropy and the chi-square pseudo distance 

^ ■ f J md In (^(^)) du(x) if 1/ absolutely continuous w.r.t. \i 



+00 otherwise 



>< xlMz-0 = I ^ d ~ ^ (x) = ~ 1 "^ a(Ai) if v absolutely continuous w - r - t - » 

\ I +00 otherwise 

Next, we precise the inequalities that will be discussed in the paper. 

Definition 0.1 The probability measure [i on M. d is said to satisfy 

the Poincare inequality V(C) with constant C if 

V 'tp : R d — >■ R C 1 with a bounded gradient, / (p 2 (x)dfj,(x) — I / <p(x)dfj,(x) I <C/ | V(^(x)| 2 <i^(3 

the transport-chi-square inequality T X (C) with constant C if 

W probability measure on R rf , W2(fi,i / ) < vC^C^IaO- 
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the log-Sobolev inequality CS(C) with constant C : R d ->KC 2 compactly supported, 

[ Lp 2 (x)ln(ip 2 (x))dfi(x) - [ ip 2 (x)dfi(x)ln( [ ip 2 (x)dn(x)) <C [ \Vip(x)\ 2 dfi(x). 

Jr Jr \Jr ) Jr 

the transport-entropy inequality Th(C) with constant C if 

Mv probability measure on R rf , ^(/U,^) < VCH(v\/j,). 

According to [8], the log-Sobolev inequality is stronger than the transport-entropy inequality 
which is itself stronger than the Poincare inequality and more precisely CS{C) =^ Tff(C) =4> 
V(C/2). The transport-entropy inequality is strictly weaker than the log-Sobolev inequality 
(see [21 Hj for examples of one-dimensional probability measures /i satisfying the transport- 
entropy inequality but not the log-Sobolev inequality) and is strictly stronger than the Poincare 
inequality (see for example [4] Theorem 1.7). 

On the other hand, the inequality xln(x) < (x — 1) + (x — l) 2 implies H{v\fi) < x^{v\ii) and 
therefore Th(C) =^ T X (C). The transport-entropy inequality implies both the transport-chi- 
square and Poincare inequalities. The relation between the two latter is therefore a natural 
question. It turns out that, by an easy adaptation of the linearization argument in [8], the 
transport-chi-square inequality implies the Poincare inequality. Moreover, in dimension d = 1, 
we are able to prove the converse implication so that both inequalities are equivalent. Last, we 
prove tensorization of the transport-chi-square inequality. 

Acknowledgement : I thank Arnaud Guillin for fruitful discussions and in particular for 
pointing out the implication T X {C) =^ V{C) and the interest of tensorization to me. 

1 Main results 

Theorem 1.1 \/d > 1, T X (C) => T(C). Moreover, when d = 1, V{C) T X (32C) and the 
transport-chi-square and Poincare inequalities are equivalent. 

Before proving Theorem 11.11 we state our second main result dedicated to the tensorization 
property of the transport-chi-square inequality. Its proof is postponed in Section HI 

Theorem 1.2 // n\ and ^2 « r e probability measures on M. dl and M. d2 respectively satisfying 
T x (Ci) and 7^(6*2), then the measure fi\ <S> ^2 satisfies T X {{C\ + 6*2(1 + \J (3c?2 + 2)c?2)) A (C2 + 
Ci(l + V(3di+2)di))). 

Remark 1.3 According to Proposition 8.4-1 [1], if Mi end [12 respectively satisfy Th(C\) and 
Th{C2), then [i\ (g) ^2 satisfies Th(C\ V C2). The constant that we obtain in the tensorization of 
the transport-chi-square inequality is larger than C\ VC2. 

The proof of the one-dimensional implication V{C) =^ T X (32C) in Theorem 11.11 relies on the 
two next propositions, the proof of which are respectively postponed in Sections [2] and El When 
d = 1, we denote by F(x) = /i((— 00, x]) and G(x) = 00, x]) the cumulative distribution 
functions of the probability measures fi and v. The cag pseudo-inverses of G (resp. F) is defined 
by G- 1 :]0, inf{x € R : G(x) > u} (resp. F -1 (u) = inf{x G R : G(x) > u}) and satisfies 

Vrr 6 R, Vu e (0, 1), x < G~ l (u) & G(x) < u. (1.1) 
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When fj, (resp. v) admits a density w.r.t. the Lebesgue measure, this density is denoted by / 
(resp. g). Moreover, the optimal coupling in (jO.ip is given by 7 = du o (F^ 1 , G -1 ) -1 where du 
denotes the Lebesgue measure on (0,1) so that W${ii,v) = ^{F- l {u) - G- X {u)) q du (see [9] 
pl07-109). We take advantage of this optimal coupling to work with the cumulative distribution 
functions and check the following proposition. In higher dimensions, far less is known on the 
optimal coupling and this is the main reason why we have not been able to check whether the 
Poincare inequality implies the transport-chi-square inequality. 

Proposition 1.4 If a probability measure on the real line admits a positive probability density 
f , then, for any probability measure v on R, 

W^,u)<A [ {F ~ G) \ x)dx. (1.2) 

JR J 



Remark 1.5 • One deduces that Wi(fM, v) < 4 J R ^ F ^ (x)dx. Notice that since, by (jl.ip 
and Fubini 's theorem, 

Wi(n,is)= / / l{F~ 1 (u)<x<G~ 1 (u)} + ^{G- 1 (u)<x<F~ 1 (u)}dxdu 
Jo il 

= / / l{G(x)<u<F(x)} + l{F(x)<u<G(x)}dudx = / \F(x) - G(x)\dx, 
it Jo il 

the stronger bound 

W?(jt,u) = ( f ^-==3 x s/f(x)dx) 2 < [ {F ~ G) \ x)dx 

\JR VJ J JR J 

is a consequence of the Cauchy-Schwarz inequality. 

• It is not possible to control J R ^ F ~^ (x)dx in terms ofW^ip,^)- Indeed for f (x) = \e~^ 
and du(x) = \e~\ x ~ m \dx, one has W^i/J.,^) = m 2 , G(x) = ^LJIL lr x<m \ + (1 — S^LZ )lr x>m \ 
and for m > 0, 

JR J Jm J z 



Next, when the probability measure fi on the real line admits a positive probability density 
satisfying a tail assumption known to be equivalent to the Poincare inequality (see Theorem 
6.2.2 pP), we are able to control the right-hand-side of (|1.2p in terms of x^{y\[£). 



Proposition 1.6 Let f(x) be a positive probability density on the real line with cumulative 
distribution function F(x) = f(y)dy and median m such that 



-00 

b ^ sup [ +0 ° f{y)dy f -^L V sup f f(y)dy H ^f- < +00. (1.3) 

x>mJx Jm J\y) x<m J —00 Jx J W ) 

Then for any probability density g on the real line with cumulative distribution function G(x) = 

J^oo g(y)dy, 

( F ~ ^ <x)dx <Ab I ^ {x)dx. (1.4) 



f Jr f 
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Remark 1.7 • The combination of these two propositions implies that any probability mea- 
sure 11 on the real line admitting a positive density f such that b < +00 satifies 7~ x (16b). 

• Proposition \1.6\ is a generalization of the last assertion in Lemma 2. 3 [6] where f is re- 
stricted to the class of probability densities foo solving foo(x) = —A(F OQ (x)) on the real 
line with 

A : [0, 1] R_ C 1 , negative on (0, 1) and s.t. A(0) = A(l) = 0, A'(0) < 0, A'(l) > 0. 

The constant b associated with any such density is finite by the proof of Lemma 2.1 |6j/. 
Moreover, in order to investigate the long-time behaviour of the solution ft of the Fokker- 
Planck equation 

dtft(x) = d xx f t (x) +d x (A'{F t (x))f t (x)), (t,x) e [0,+oo) x R 

to the density foo such that f^xf 00 (x)dx = j^xfo(x)dx, J2J/ first investigates the ex- 
ponential convergence to of J R * (x)dx (Lemma 2.8) before dealing with that of 

Jr Ut /f°° )2 ( x ^ dx ( Theorem 2 -4)- 

• Even when b < +00, it is not possible to control f R ^-j^ (x)dx in terms of J R ^ F ~^ (x)dx. 
Lndeed let f(x) = and 

for n G N, g n (x) = f (x)l {k _ 1>k) (\x\) + ^ — ^— l[ Xfc ,fc+i)(kl) 

k<n k>n 

where x k = k+1 — 2 In ^1 + ^-e belongs to (k,k + l) and is such that e~^dx = 

fk +1 e~ x dx. One has, using \/y > 0, ln(l + y) > by concavity of the logarithm and 
1 + ^rpe ~2~ < yfe for the inequality, 



[ (/ ~/ w) (x)dx = 2 [ °° °?(x)dx-e- n = 2Vlnfl + ^e-^ 1> ) 

JR f Jn f fc > n V 2 J 

(e - 1) V"^ _fe + l _ _n + l 

>- — ^> e 2 - e n = (ye + l)e 2 -e n . 



On the other hand, since for k > n and x G [k,k + 1], 1 — < G n {x) < F(x) = 1 — ^r, 

(F - G n ) 2 . . ^ /- fc+1 (e- k - e~ x f , e 2 - 2e - 1 _ 

- -^-(x)dx < > / i -dx = e~ n . 

f Jk e~ x e - 1 



Proof of Theorem 11.11 : The implication T X {C) =^ "P(C) is obtained by linearization of 
the transport-chi-square inequality T X {C). For u £ = (1 + ^(/Oa 4 with </> : M rf — >■ R a C 2 function 
compactly supported and such that f Rd 4>(x)dfi(x) = 0, according to [8j p394, there is a finite 
constant K not depending on e such that 

/ ^(xW(x) <J[ WHx)\^(x) x + ^MOv^l . 
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When T X (C) holds, then W^/U, v e ) < £yC J Rd 4> 2 {x)dp{x) and taking the limit e — > 0, one 
deduces that 




4> 2 {x)dp(x) < a / \V(f){x)\ 2 dn{x) x JC / 2 {x)dp(x). 

This implies J Rd 4> 2 (x)dp(x) < C f Rd \V(f)(x)\ 2 dp(x). Let now ip, <f> n : R d — > R be C 2 functions 
compactly supported with </> n taking its values in [0, 1], equal to 1 on the ball centered at the 
origin with radius n and V(p n bounded by 1. Taking the limit n — > oo in the inequality written 
with <j) replaced by <p n = ip — <j) n j R * ^(x)dfi(l) ' one deduces that the Poincare inequality V(C) 
holds for <p. The extension to C 1 functions ip with a bounded gradient is obtained by density. 

To prove the converse implication, we now suppose that d = 1, p satisfies the Poincare inequality 
V(C) and that X2(v\p) < +oo. We set p n = p n * p and u n = p n * v for n > 1 where 

Pn(x) = \j^ e ~^ ( L5 ) 

denotes the density of the centered Gaussian law with variance 1/n. For p a C 1 function on R 
with a bounded derivative such that = L p(x)dp n (x) = J R p n * p(x)dp(x), one has 

p 2 (x)dp n (x) = / (p„ *<p 2 ){x) - (p n * ip) 2 (x)dp(x) + / (p n * p) 2 (x)dp(x) 
Jr Jr 

< / -(p n *{p') 2 ){x)dp{x) + C / (p n *99') 2 (^)^(x) 
Jr n Jr 
1 + nC f, , , 2 w , / 1 + nC f / 2 , 



< / (pn*(<f/y)(x)dn(x) = / (p') 2 (x)dp n (x) 

n Jr n J R 

where we used the Poincare inequalities for the Gaussian density p n (|ij Theoreme 1.5.1 plO) 
applied to p> and for p applied to p n * p for the second inequality then Jensen's inequality. 
The probability measure p n admits a positive density w.r.t. the Lebesgue measure and satisfies 
7- > ( 1 "t" ). According to Theorem 6.2.2 [1], this property is equivalent to the fact that the 
constant associated with p n through (|1 .3|) is b n < 2 1+ ™ C ■ Combining Propositions 11.4 1 and ll.6| 
one deduces that 

o 1 + nC 2 
WZ{p n ,v n ) < 32 

X,2 (yn \P"n ) • 

n 

To conclude, let us check that W 2 (a*, v) < liminf n _>. 00 W^ipn, v n ) and that x\{ v n\Pn) < x\{ v \p)- 
First, the probability measures p n with c.d.f. F n (x) = p n ((—oo,x]) (resp v n with c.d.f. G n (x) = 
v n ((—oo, x])) converge weakly to p (resp. v) which ensures that du a.e. on (0, 1), G~ l {u)) 
tends to {F~ l {u), G _1 (u)) as n — > oo. With Fatou lemma, one deduces that 

W${u,v) = [ {F-\u) - G- l {u)) 2 du < liminf f (F^fu) - G- l {u)) 2 du = liminf W${Un, u n ). 
Jo n ^°° Jo " n ^°° 

On the other hand, by Jensen's inequality, 

, , , 7»($W-iK(x-rtw y i 

jRpn{x -y)dp{y) J J R 
du \ 2 

^(y)- 1 ) Pn(x - y)dp(y)dx = xl(v\p)- 
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Remark 1.8 Since 

7 2 



W^iUni v n ) < inf / ((x +z)-(y + z)) 2 d-y{x, y)p n {z)dz = W£(p,, u), 



one has lim^oo W 2 (p n , v n ) = W 2 (p, v). 

Moreover, when xK^Im) < +oo, then interpreting p n and (resp v n ) as the distribution at time 
- of a Brownian motion initially distributed according to p (resp. v) and using Theorem 1.7 
one obtains ]im n -+ 00 x§(f n |/V) = xIMaO- 



2 Proof of Proposition 11.41 

To prove the proposition, one first needs to express the Wasserstein distance in terms of the 
cumulative distribution functions F and G instead of their pseudo-inverses : 



Lemma 2.1 



Wf 0, v) = / {(F(x Ay)- G(x V y))+ + (G(x Ay)- F(x V y))+) dydx. (2.1) 



Proof of Lemma 12.11 : Let us first suppose that p admits a positive continuous density / 
w.r.t. the Lebesgue measure. Using the change of variables (v,w) = (F(x), F(y)) for the third 
equality then the equivalence w < F(G~ 1 (u)) 44> F~ 1 (w) < G~ l (u) 44> G(F~ 1 (w)) < u deduced 
from (jl.ip . one obtains 

Wi(fx,u) = [ - F-\u)) 2 du 

Jo 

= 1 J {^{F- 1 {u)<x<y<F' 1 {F{G~ 1 (u)))} + ^{F-i(F(G-\u)))<x<y<F-\u)}) dx dydu 

f , v dvdw 

= J[o i]3 ( { U ^ W<F ( G ~ 1( - U W + 1 {i r (G- 1 («))<f<~<«}i f(^F- 1 (v))f(F- l (w)) 

1 ^ ^ dwcfo 



J v JO 
1 /•! 



{ 1 {G{F-Hw))<u<v} + 1{«,<«<G(F-1(«))}) ^/ff-l^))/^-!^)) 

/(F-»)/(F-») 

»+oo 

2 / / ((F(x) - G(y)) + + (G(x) - F(y)) + ) dydx. (2.2) 



2 

2 I I {(v-G(F-\ V ,)))+ + (G{F- 1 (v))-w)+) 



Jv 



By symmetry, one deduces that (|2.ip holds. 

In the general case, one approximates p and v by the probability measure p n = p n * p and 
v n = Pn* v (see (jl.5p for the definition of /0 n ) which admit smooth positive densities w.r.t. the 
Lebesgue measure. Let F n (x) = p n ((—oo,x]) and G n (x) = u n ((—oo,x]) denote the associated 
c.d.f.. One has lim™-^ W2ip.n1 v n ) = W2(p, v) according to Remark ll.8l Moreover, by the weak 
convergence of p n to p and v n to v, dx a.e. on R, (F n (x),G n (x)) tends to (F(x),G(x)). Since, 
by Jensen's inequality, 

(F n (x) - G n (y)) + =( [ (F(x -z)- G(y - z))p n (z)dz) < [ (F{x - z) - G{y - z)) + p n (z)dz, 
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the right-hand-side of (|2.2[) gets smaller when replacing (F,G) by (F n ,G n ) and tends to the 
expression with (F, G) as n — > oo by Fatou lemma. Hence (|2.2p still holds. I 



Proof of Proposition 11.41 : One has 

r-G~ 1 {F(x)) 

(F(x) - G(y)) + dy = l {F{x)>G{x)} / (F(x) - G(y))dy < (F(x) - G(x))+(G-\F(x)) - x). 

(2.3) 

By Fubini's theorem and a similar argument, 



(G(x) - F{y)) + dydx = / / (G(y) - F{x)) + dydx 

JR J-oo 

< [ (G(x)-F(x))+{x-G- 1 {F(x)))dx 



With ()2.2p and (|2.3p . then using Cauchy-Schwarz inequality and the change of variables u = 
F(x), one deduces that when fi admits a positive density / w.r.t. the Lebesgue measure, then 

W%{p,v)<2 [ \G(x)-F(x)\\x-G' 1 (F(x))\dx 
Jr 

x 2 \ 1/2 / , \ 1/2 



< 2 



{G(X) f(x) { dX ) X (X (X " G' 1 ^))?!^ 







Recognizing that the second factor in the r.h.s. is equal to ^(/U,!/), one concludes that (|1.4p 
holds as soon as W2(n,v) < +oo. To prove (jl.4p without assuming fmiteness of W2(/i,z^), one 
defines a sequence (G n ) n of cumulative distribution functions converging pointwise to G by 
setting 

>(s) A ± if z <<?-!(£) 
G„(x) = I G(x) if x € [G-H^.G?" 1 ^)) 
F(x) V 2=1 if x > G-V— 

For x < G-\±), G(x) < ± |F(x) - G n (x)| = (F(x) - £)+ < min(|F(x) - G(x)\,(F(x) - 
< |F(x) - G n+1 (x)|. Similarly, for x > G' 1 (2=1), G(x) > 2=1, |F(x) - G n (x)| = 
(2=i-F(x)) + < min(|F(x)-G(x)|,(^ T -F(x)) + ) < |F(x) - G„+i(x)|. As a consequence, for 
fixed x E R, the sequence (|G ra (x)— F(x)|) ng N is non-decreasing and goes to \G(x) — F(x)\ as n — > 
oo. By monotone convergence, one deduces that lim n _» +00 J R (Gn-F) ^ x ^ x _ J R (x)rix. 
Moreover, 

'^(ujAG^fi) if u< i 



G~ 1 (ti) = ^ G" 1 ^) if u 6 



J. n ~i l 

n ' n J 

F^MVG-H— ) if u> 2=1 



As a consequence, denoting by f n the probability measure with c.d.f. G„, W$(LL,v n ) = J Q 1 (F _1 (n)- 
G~ 1 («)) 2 dn < +oo and VF|(/i,i/) < lim inf VF|(^, f n ) by Fatou Lemma. One concludes by 
taking the limit n — > +oo in (|1.4|) written with G n ) replacing (V, G). I 
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3 Proof of Proposition 11.61 



Let us assume that b < +00 and J K — p (x)dx < +00. By integration by parts, for n £ N*, 



n (F - Gf 
-n f 



(x)dx 



(F-G) 2 (x) 



dy 

W) 



(F — G)(f — g)(x) 



* dy 



f(y) 



dx. (3.1) 



For x larger than the median m of the density /, by definition of b, then by the equality 
(F — G)(x) = f£°(g — f)(y)dy and Cauchy-Schwarz inequality, one has 

.2 



< (F - G) 2 (x) 



* dy < h {F-G)\x) = h UrU-g){y)dy) < , 



m m - irmdy 



irmdy 



U-9Y 

f 



{y)dy. 



where the right-hand-side tends to as x — > +00 by integrability of — ^ on the real line. 
Similarly, lim x ^._ 00 (F — G) 2 (x) f™ jnjr = 0. Taking the limit n — > oo in (13. If) and using again 



the definition of 6, one deduces that 
(F - G) 2 



f 



{x)dx<2b / \(F - G)(f - g)\(x) 



^-{x>m} 



1 



{x<m} 



dx. 



(3.2) 



is locally integrable on K since the 



The product \(F - G)(f - g)\{x) x ^ ^ f(y)dy -r j^j^ 

first factor is integrable and the second one is locally bounded. Let a n < +oo denote the integral 
of this function on [— n, n]. 



By Cauchy Schwarz inequality, 

a n < 1 



U-gf 



f 



(x)dx 



f(F-G) 2 (x) 



1/2 



{x>m} 



{x<m} 



dx 



(3.3) 



Now, setting e n = ^ + , we obtain by integration by parts that for n > \m\ 

6 L f(v) d v /_ ™ f(y)dy ' J H ~ 1 1 



j n _J{F-G?{x)( 



(F-G) 2 (x) 



^-{x>m} 



rmdy 



)• 

(F — G)(f — g){x) 



dx 



Ix°°f(y)dy 



dx 



(F-G) 2 (x) 



-4(F-G) 2 (m)+e n -2 H (F - G)(f - g)(x) ( -± 

J-n \ J x 



X!oo f{y)dy 

{x>m} 



+ 2 



(F-G)(f-g)(x) 



/!«, f(y)dy 



dx 



1 



{x<m} 



T f(y)dy X!^ f(y)dy 



dx 



< 2a n + e n . 

Plugging this estimation in (j3.3[) . one deduces that 



Vn > |m|, a n < l{ a „>o} 2 H 



/ 



(x)dx. 



Using that, according to the analysis of the boundary terms in the first integration by parts 
performed in the proof, hm n ^. +00 e n = and that (a n ) n is non-decreasing, one may take the 
limit n — > oo in this inequality to obtain 

(/ " 9? 



\{F - G){f - g)\{x) \ 
One easily concludes with (13. 2p . 



{x>m} 



1 



Cf(y)dy ' I-oomdy) 



{x<m} 



dx < 2 



/ 



-(x)cte. 
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4 Proof of Theorem 11.21 



Let v be a probability measure on R 1 x R 2 ) with respective marginals V\ and V2 and such 
that Xi(y\p\ ® M2) < +00, /) denote the Radon-Nykodym derivative an d for xi 6 

p\(x\) = J Rd2 /)(xi,x 2 )(i^ 2 (x 2 ). Notice that 



taO^Mi® M2) = / (/>(^i,^2)-l) dfi 1 (x 1 )dp 2 (x2)- 

JR d l+ d 2 

According to the tensorization property of transport costs (see for instance Proposition A.l [5]), 

w 2 V® A*2 S f) < wK/xx,^) + / i {pi( , l)>0 }Ty 2 2 ^2,^4^2) rfi^(xi) (4.1) 

By the inequality T X (C\) satisfied by p\, the equality = PlO^i) = J^d 2 p(xi, ^2)^2(^2) 

and Jensen's inequality, one has 

W 2 2 (/zi,fi) < Cixi^ilm) = Ci / (pi(xi)-l) 2 d/ii(xi) < Ci^(i/,mi®M2). (4.2) 

So the first term of the right-hand-side of (|4.ip is controled by X^(y, Mi ® ^2)- By the inequality 

T X (C 2 ) satisfied by p 2 , when p^i) > 0, Wf ( M2 , g^M 2 ) < C 2 / Rd2 (*ggg* - l)* dp 2 {x 2 ). 
Unfortunately, there is no hope to control 

/ l{pi(ai)>0} ( - 1 ) du 1 (xi)dp 2 (x2) 

J R d 1+ d 2 XH \ pi(xi) J 

= [ 1 {pi(ai)>0} ( £-gligL - 1 ) Pl (a;i)d/ii(xi)^ 2 (x 2 ) 

in terms of xK^Mi ® M2) because of the possible very small values of pi(x{). Therefore it 
is not enough to plug the latter inequality into the right-hand-side of (|4.ip to conclude that 
pi ® /i 2 satisfies a transport-chi-square inequality. So we are only going to use this inequality 
for p\(x\) > — where a is some constant larger than 1 to be optimized at the end of the proof. 
Using Lemma 14.11 below with /3 = a, one obtains 

= aC 2 (p(x 1 ,x 2 ) - l) 2 lr , x l)> i } dp 1 (x 1 )dp 2 {x 2 ). (4.3) 



For small positive values of pi, we use the estimation of W 2 yp2, ^j^jp2j deduced from the 
optimal coupling for the total variation distance. If v ^ p,, let e denote a Bernoulli random 
variable with parameter p = j Rd2 A 1^ d/i 2 (x 2 ) and (X,Y,Z) denote an independent 

R d2 x R d ' 2 x R d2 -valued random vector with X, Y and Z respectively distributed according 

to H^ A1 )^ 2(X2) ' I^ 1 - w) + d ^ 2) ^ r^_(f^-l) + ^2(x 2 ). The 
random variables eX + (1 — e)Y and eX + (1 — e)Z are respectively distributed according to 
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d/^fe) and p ^)^ du^lx^)' As a consequence, 

, < E ((1 - ef(Y - Zf) = (1 - p)E ((Y - Z) 2 ) 



wi ( ^ 2 ^ 



<2(l-p) 



El Y 



2/2^2(2/2) 



< 2 



X 2 - 2/2C/M2(2/2) 



/o(xi,a;2) 



Pi(si) 



One deduces 



pOi, 



pi(zi) 



/x 2 ci^i(xi 



< 2 



< 2 



»2 - / 2/2^2(2/2) 



|p(xi,x 2 ) l {pi(jBl)< ijd|ii(xi)d/i2(a;a) 



a^2 



2/2^2(2/2 



1 



{pi(^i)<^} 



^1(^1)^2(^2) 



1/2 



1/2 



< 2(^2^(3^+2)^2 



(p(xi,x 2 ) - /)i(xi)) 2 l { , l}< i 1^1(^1)^2(^2) 

il+c?2 ir v ' ot J 

a 2 (p 1 (x 1 )-l) 2 i ^ 1/2 



< 



C 2 a V(3d 2 + 2)^2 
a — 1 



[(p(x 1 ,x 2 ) - l) 2 - (pi(xi) - l) 2 ]l {pi{xi)< ±. } d(j,i(xi)dfj, 2 (x2) 
(p(xi,x 2 ) - l) 2 l {pi(xi)< L } dp 1 (x 1 )dp2(x 2 ), 



1/2 



where we used Cauchy Schwarz inequality for the second inequality, then Lemma 14.21 below 
and an explicit computation of the third factor for the third inequality and last the inequality 
Vby/a - b < § for any a > b > 0. 

Inserting this estimation together with (I4.2D and (I4.3D into (14. ID . one obtains 



W^pi® <Cixl(vi,Pi) + C 2 a IV 



V(3d 2 + 2)<j 2 
a — 1 



xK^i ® M2)- 



For the optimal choice a = 1 + y 7 (3c?2 + 2)a?2, one concludes that the measure p\ <g> p 2 satisfies 
T X {C\ + C2(l + y (3c?2 + 2)d 2 )). Exchanging the roles of \i\ and p 2 in the above reasonning, one 
obtains that p\ ® fjL 2 also satisfies T X (C2 + C\(l + y (3c£i + 2)di)). 



Lemma 4.1 For /3 > a > ; 



/ ( 



p(x 1 ,x 2 ) 



l+<*2 V Pll^lJ 



1 1, 



{pi ( Xl )>L } du 1 (x 1 )dfi2{x 2 ) +/3 / (pi(a?i)-l) ^(a^i^Mifa) 
</3 / (p(a;i,x 2 ) - l) 2 lr f wi^/i^xi)^^)- 

jRdl+d 2 tW^U_ Q l 



Proof : Developping the squares and using the definition of p\ and the equality dv\{x\ l 
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Pi(xi)dfj,i(xi), one checks that the difference between the right-hand-side and the first term of 
the left-hand-side is equal to 







p 2 (x 1 ,x 2 )d^ 2 (x2) + (1 - W)pi{x x ) + P 



One easily concludes by remarking that the first integral is retricted to the x\ G R 1 such that 
^-j < a < /3 and that J K<J2 p 2 (xi, x 2 )dp 2 (x 2 ) > (/ Rd2 pfci, x 2 )dp2{x 2 )) = Pi(^l)- ■ 



Lemma 4.2 If a probability measure p on M. d satisfies T(C), then 



ydp{y) 



dp(x) < dC and 



ydp(y) 



dp(x) < (3d + 2)dC 2 



Proof : According to Theorem II .1\ p satisfies V"(C). By spatial translation, one may assume 
that L d ydp(y) = 0. Applying the Poincare inequality V{C) to the functions x = {x\, . . . , Xd) G 
M 1 ^ 1 — y Xi, x 1 — y xf and x i-)- XiXj with I < i ^ j < d, yields, 



xfdp(x) < C 



[ xfdp(x) < 4C ( xjdp{x) +( [ 



(xiXj) dp(x) < C \ x i +Xjdp(x) + I / XiXjdp(x) J < 2C + / x i dp{x) j Xjdp(x) < 3C 



One easily concludes by summation of these inequalities. 
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